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CLAIMS 

1 l.A method of recognizing and indexing doc-uments in a system having a scanner 

2 (30) connected to a computer, the method comprising: 

3 scanning the documents; 

4 using a pointing device or member of the computer to designate an 

5 arbitrary point P in at least one box of the documents; 

6 recognizing by OCR the characters in said box; and 

7 storing the recognized characters in a first database connected to the 

8 computer to enable documents scanned in this way to be indexed. 

1 2. The method according to claim 1, wherein said designation step comprises 

2 searching for and identifying the box of the document which contains said point 

3 P designated by the user. 

1 3. The method according to claim 2, wherein said step of searching for and 

2 identifying said box is performed by applying a shape search algorithm over a 

3 determined search zone surrounding said point P as previously designated by 

4 the user. 

1 4. The method according to claim 3, wherein said shape search algorithm is a 

2 projection algorithm which counts the number of pixels present in each vertical 

3 or horizontal line of said determined search zone and which, on the basis of 

4 these count numbers, finds the horizontal and vertical lines present in said 

5 search zone by examining the peaks in the X and Y projection profiles. 

1 5. The method according to claim 3, wherein said shape search algorithm is an 

2 algorithm based on the Hough transform. 

1 6. The method according to claim 1, wherein said OCR step is preceded by a step 

2 in which the user defines the type of character to be recognized in said box of the 

3 document. 

1 7. The method according to claim 1, wherein said scanning step is performed 

2 initially for a set of documents to be processed, with said steps of identifying the 
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3 box and performing OCR being performed subsequently in succession for each of 

4 the documents. 

1 8. The method according to claim 1, wherein said scanning step is initially 

2 performed for a first document, with said steps of identifying the box and 

3 performing OCR subsequently being performed on that document so as to define 

4 a sequence, with said sequence then being repeated in succession for each of the 

5 documents to be processed. 

1 9. The method according to claim 1, wherein said documents to be recognized 

2 and indexed are a set of technical drawings of the same or different types. 

1 10. The method according to claim 1, wherein said documents to be recognized 

2 and indexed are a set of forms, of the same or different tjTpes. 

1 11. An apparatus for recognizing and indexing documents, the apparatus 

2 comprising: 

3 a scanner for scanning a document and delivering an image of the 

4 document; 

5 a computer connected to the scanner to receive said scan image; 

6 a first database connected to said computer for storing said scanned 

7 image; and 

8 first software for using a pointing device or member of the computer to 

9 designate an arbitrary point P in at least one box of the image, for searching for 

1 0 and identifying the box containing said point P designated by the user, for 

1 1 recognizing by OCR the characters in said box, and for storing the recognized 

12 characters so as to enable images scanned in this way to be indexed. 

1 12. The apparatus according to claim 11, further comprising a second database 

2 connected to the computer to store characterization data such that the box 

3 subsequently can be identified automatically by said software without any point 

4 P within said box being designated. 

1 13. The apparatus according to claim 11, wherein further comprising second 

2 software for defining the type of data to be recognized in said document box. 
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1 14. The apparatus according to claim 12, wherein the first and second databases 

2 are integrated in the memory of the computer. 

1 15. The apparatus according to claim 11, wherein said pointing member is the 

2 keyboard of the computer or a finger of the user. 

1 16. A computer-readable medium having embodied thereon software to be 

2 processed by a computer connected so as to receive a scanned image, the 

3 software being operable to cause said computer to perform the functions of the 

4 first software of claim 1 1 . 



